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STIC Biotechnology Systems Branch 

RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 

Source: /^T 

Date Processed by STIC: / 6 I fti, 1 2~OOS~ 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTBN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.2.2 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 01/14/05): 

U.S. Patent and Trademark Office, Mail Stop Sequence, Customer Window, Randolph Building, 401 Dulany Street, 
Alexandria, V A 22314 

Revised 01/24/05 
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8 



11 



'invalid Line Length fhe rules require that a line not exceed 72 charactersjh:length. This includes white spaces. 



— IV 

Misaligned Amino The numbering under each 5* axro^acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 




Non-ASCII 



The submitted file was not saved iHASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text 



Patentln 2.0 
* "bug" 



Variable Length Sequence(s) contain ns or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

^Skipped Sequences Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(OLD RULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ED NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

^Skipped Sequences Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence. 

(NEW RULES) <210> sequence id number 

<400> sequence id number 
000 

Use of n s or Xaa's Use of n's and/or Xaa's have been detected in the Sequence Listmg. 

"(NEW RULES) Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



10 Invalid <213> 

Response 



Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <2 1 3> response is Unknown or 
is Artificial Sequence 



Use of<220> 



Sequence(s) 



missing the <220> "Feature" and associated numeric identifiers and response: 



12 Patentln 2.0 

"bug" 



Use of <220> to <223> is MANDATORY if <21 3> "Organism" response is ^Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register "06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file : 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



1 3 Misuse of n/Xaa 'n" can only represent a single nucleotide . ; Xaa v can only represent a single amino acid 

AMC - Biotechnology Systems Branch - 09/09/2003 
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RAW SEQUENCE LISTING DATE: 10/14/2005 

PATENT APPLICATION: US/10/551, 508 TIME: 11:04:23 

Input Set : N:\FANTU\10551508.raw.txt 
Output Set: N:\CRF4\10142005\J551508.raw 

3 <110> APPLICANT: KIM, Young Tae 

4 LEE, Jae Hyung 

7 <120> TITLE OF INVENTION: Gene involved in the biosynthesis of carotenoid and marine 

8 microorganism, paracoccus haeundaesis, producing the 

9 carotenoid 

11 <130> FILE REFERENCE: 428.1056 

13 <140> CURRENT APPLICATION NUMBER: US/10/551,508 
13 <141> CURRENT FILING DATE: 2005-09-29 

13 <150> PRIOR APPLICATION NUMBER: PCT/KR2004/000752 

14 <151> PRIOR FILING DATE: 2003-03-31 

16 <150> PRIOR APPLICATION NUMBER: KR2003-20222 

17 <151> PRIOR FILING DATE: 2003-03-31 

19 <150> PRIOR APPLICATION NUMBER: KR2003-20023 

20 <151> PRIOR FILING DATE: 2003-03-31 
22 <160> NUMBER OF SEQ ID NOS : 18 

24 <170> SOFTWARE: Kopatentln 1.71 



ERRORED SEQUENCES 

52 <210> SEQ ID NO: 3 

53 <211> LENGTH: 1454 

54 <212> TYPE: DNA 

55 <213> ORGANISM: Paracoccus haeundaesis 

57 <400> SEQUENCE: 3 

58 caacttgaga gtttgatcct ggctcagaac gaacgctggc 
60 gtcgagcgag accttcgggt ctagcggcgg acgggtgagt 
62 tctctacgga atagccccgg gaaactggga gtaataccgt 
64 atttatcgga gaaggatcgg cccgcgttgg attaggtagt 
66 aagccgacga tccatagctg gtttgagagg atgatcagcc 
68 cccagactcc tacgggaggc agcagtgggg aatcttagac 
70 agccatgccg cgtgagtgat gaaggcctta gggttgtaaa 
72 aatgacggta ccagcagaag aagccccggc taactccgtg 
74 gagggggcta gcgttgttcg gaattactgg gcgtaaagcg 
76 tcagaggtga aatcccaggg ctcaaccttg gaactgcctt 
78 tcgagagagg tgagtggaat tccgagtgta gaggtgaaat 
80 accagtggcg aaggcggctc actggctcga tactgacgct 
82 gcaaacagga ttagataccc tggtagtcca cgccgtaaac 
84 aagcatgctt gtcggtgtca cacctaacgg attaagcatt 
86 caagattaaa actcaaagga attgacgggg gcccgcacaa 
88 attcgaagca acgcgcagaa ccttaccaac ccttgacatg 
90 cagctttctc gtaagagacc tgcacacagg tgctgcatgg 
92 gagatgttcg gttaagtccg gcaacgagcg caacccacgt 



GotnpW 



ggcaggctta 
aacgcgtggg 
atacgccctt 
tggtggggta 
acactgggac 
aatgggggca 
gctctttcag 
ccagcagccg 
cacgtgggcg 
tgaaactatc 
tcgtagatat 
gaggtgcgaa 
gatgaatgcc 
ccgcctgggg 
gcggtggagc 
gcaggaccgc 
ctgtcgtcag 
ccctagttgc 



acacatgcaa 
aacgtgccct 

tgggggaaag 

atggcccacc 
tgagacacgg 
accctgatct 
ctgggaagat 
cggtaatacg 
gactggaaag 
agtctggagt 
tcggaggaac 
agcgtgggga 
agacgtcggc 
agtacggtcg 
atgtggttta 
tggagagatt 
ctcgtgtcgt 
cagcattcag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/551,508 



DATE: 10/14/2005 
TIME: 11:04:23 



Input Set : N:\FANTU\10551508.raw.txt 
Output Set: N:\CRF4\10142005\J551508.raw 



E--> 



E--> 



E 
E 
E 
E 
E 
E 



- - > 



- - > 



- - > 



- - > 



- - > 



- - > 



E--> 



94 ttgggcactc tatggaaact gccgatgata agtcggagga aggtgtggat gacgtcaagt 1140 
96 tctcatggcc cttacgggtt gggctacaca cgtgctacaa tggtggtgac agtgggttaa 12 00 
98 tccccaaaag ccatctcagt tcggattgtc ctctgcaact cgagggcatg aagttggaat 1260 
100 cgctagtaat cgcggaacag catgccgcgg tgaatacgtt cccgggcctt gtacacaccg 1320 
102 cccgtcacac catgggagtt ggttctaccc gacgacgctg cgctaacctt cggggggcag 1380 
104 gcggccacgg taggatcagc gactggggtg aagtcgtaac aaggtagccg taggggaacc 1440 



cggctgga tcac 




106 

107 

325 <fZT0> SEQ ID NO: 5 

326 <211> LENGTH: 729 - 

327 <212> TYPE: DNA 

32 8 <213> ORGANISM: crtW gene 

330 <400> SEQUENCE: 5 

331 atgagcgcac atgccctgcc caaggcagat ctgaccgcca ccagcctgat cgtctcgggc 
333 ggcatcatcg ccgcgtggct ggccctgcat gtgcatgcgc tgtggtttct ggacgcggcg 
335 gcgcatccca tcctggcgat cgcgaatttc ctggggctga cctggctgtc ggtcggtctg 
337 ttcttcatcg cgcatgacgc gatgcacggg tcggtcgtgc cggggcgtcc gcgcggcaat 
339 gcggcgatgg gccagctggt cctgtggctg tatgccggat tttcgtggcg caagatgatc 
341 gtcaagcaca tggcccatca ccgccatacc ggaaccgacg acgaccccga tttcgaccat 
343 ggcggcccgg tccgctggta cgcgccjcttc atcggcacct atttcggctg gcgcgagggg 
345 ctgctgctgc ccgtcatcgt gacggtctat gcgctgatcc tgggggatcg ctggatgtac 
347 gtggtcttct ggccgctgcc gtcgatcctg gcgtcgatcc agctgttcgt gttcggcacc 
349 tggctgccgc accgccccgg ccacgacgcg ttcccggacc gccataatgc gcggtcgtcg 
351 cggatcagcg accccgtgtc gctgctgacc tgctttcact ttggtggtta tcatcacgaa 
353 caccacctgc acccgacggt gccttggtgg cgcctgccca gcacccgcac caagggggac 

355 accgcatga 

356 <f2j£ 

359 <210> SEQ ID NO: 6 



360 <211> LENGTH : QA 

361 <212> TYPE: PRT 

362 <213> ORGANISM: crtW amino acid 

364 <400> SEQUENCE: 6 

365 Met Ser Ala His Ala^ Leu Pro Lys Ala Asp Leu Thr Ala Thr Ser 

366 1 CfP [O @) ( " 

368 He Val Ser Gly Gly He He Ala Ala Trp Leu Ala Leu His V3I His 

369 * ^O®. ' \ ^jgSg) * i>0 (5$ 

J a, He Ala 



-7 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 




sr Leu 




3 71 Ala Leu Trp. EJhe^Leu Asp Ala Ala Ala His Pro He LeuAl 
372 * . $0 <f§) M^L 

374 Asn Phe Leu Gly Leu Thr Trp^Leu Ser Val Gly Leu Phe Phe He Ala 

375 - ' ■ s*> djb ' s <® 

377 His Asp, Ala Met His Gly Ser Val Val Pro Gly Arg. Pro Arg Gly Asn 

378 <@ - * 9©<f|!) ' ® 

380 Ala Al,a Met Gly Gin Leu VaJ Leu Trp Leu Tvr Ala Gly 

381 <RD</,4£> - 

383 Arg Lys Met He Val Lys His Met AlaHis. His Arg His Thr Glv, T 

384 ' iqgltf" fC5i/ L05-' {I'D "' ' 

386 Asp Asp Asp P-*-o Asp Phe Asp His GlyGly Pro Val Arg Trp JTyr Ala 

387 *< i^c^ l>S~l <U5? 

389 Arg Phe He Gly Tar Tyr Phe Glv Trp Arg Glu Gly Le.u Leu Leu Pro 



Phe Ser^Tm 
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PATENT APPLICATION: US/10/551,508 TIME: 11:04:23 



Input Set : N:\FANTU\10551508.raw.txt 
Output Set: N:\CRF4\10142005\J551508.raw 



Val lie Val Thr Val Tyr Ala Leu lie Leu Gly A^.d Arg Trp Met Tyr* 





175 





E--> 390 130 135 140 

392 
E--> 393 

395 Val Val Phe Trp Pjra. Leu~Pro Ser lie Leu Al,a SeiT lie Gin Leu Phe 

e--> ase ; " ' M<gp ' }p€>\ %f$> ' lifr"' 

398 Val Phe Gly Thr Trp leu Pro His Arg Pro (iiy His Asp Ala Phe Pro 

E--> 399 -tfWv". ' ' " W & 190 ' \ ~ 

401 Asp Arg His Asn ^Ala Arg Ser Ser Arq lie Ser Asp Pro Val_Ser Leu 

e--> 402 * / * &> v i^r-^ 

404 Leu Thr Cvs Phe His Pije Glv GlyTyr His His Glu His' rixs Le.u Hi-.s 
E--> 405 k 2/T(21S) ^gj^ 

407 Prq ^hr Val Pro Trp Trp Ar;g Leu Pr;o Ser Thr ^r gT l^r Lys Gly Asp. 
E--> 408 <Zp> ; ' 2>4f ^fV 

410 ^hr; Ala _ V . 

414 ; <llj> SEQ ID NO: 7 

415 <211> LENGTH: 489 

416 <212> TYPE: DNA 

417 <213> ORGANISM: crtZ gene 

419 <400> SEQUENCE: 7 

420 atgaccaatt tcctgatcgt cgtcgccacc gtgctggtga tggagttgac ggcctattcc 60 
422 gtccaccgtt ggatcatgca cggccccctg ggctggggct ggcacaagtc ccaccacgag 120 
424 gaacacgacc acgcgctgga aaagaacgac ctgtacggcc tggtctttgc ggtgatcgcc 180 
426 acggtgctgt tcacggtggg ctggatctgg gcgccggtcc tgtggtggat cgctttgggc 240 
428 atgaccgtct atgggctgat ctatttcgtc ctgcatgacg ggctggttca tcagcgctgg 300 
430 ccgttccgct atatcccgcg caagggctat gcccgccgcc tgtatcaggc ccaccgcctg 360 
432 caccacgcgg tcgagggacg cgaccattgc gtcagcttcg gcttcatcta tgcgccgccg 42 0 
434 gtcgacaagc tgaagcagga cctgaagacg tcgggcgtgc tgcgggccga ggcgcaggag 480 

E--> 436 cgcacgtga 

437 489 " ' " 

440 <210> SEQ ID NO: 8 

441 <211> LENGTH: 162 

442 <212> TYPE: PRT 

443 <213> ORGANISM: crtZ amino acid 

445 <400> SEQUENCE: 8 

446 Met Thr Asn Phe Leu lie Val Val Ala Thr Val Leu Val Met Glu Leu 

E--> 447 1 5 10 15 

449 Thr Ala Tyr Ser Val His Arg Trp lie Met His Gly Pro Leu Gly Trp 
E--> 450 20 25 30 

452 Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 
E--> 453 35 40 45 

455 Asn Asp Leu Tyr Gly Leu Val Phe Ala Val lie Ala Thr Val Leu Phe 
E--> 456 50 55 60 

458 Thr Val Gly Trp lie Trp Ala Pro Val Leu Trp Trp lie Ala Leu Gly 
E--> 459 65 70 75 80 

461 Met Thr Val Tyr Gly Leu He Tyr Phe Val Leu His Asp Gly Leu Val 
E--> 462 85 90 95 

464 His Gin Arg Trp Pro Phe Arg Tyr He Pro Arg Lys Gly Tyr Ala Arg 
E--> 465 100 105 110 

467 Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly Arg Asp 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/551,508 



DATE: 10/14/2005 
TIME: 11:04:23 



Input Set : N:\FANTU\10551508.raw.txt 
Output Set: N:\CRF4\10142005\J551508.raw 



E- 




468 






115 






120 


125 






470 


His 


Cys 


Val Ser Phe Gly Phe 


He 


Tyr Ala Pro Pro 


Val Asp Lys Leu 


Cm 




471 




130 






135 




140 






473 


Lys 


Gin Asp Leu Lys Thr 


Ser 


Gly 


Val Leu Arg Ala 


Glu Ala Gin Glu 


E- 




474 


145 




150 




155 






476 


Arg 


Thr 
















7 


<210> SEQ ID NO: 10 














528 


<211> LENGTH: 386 














^9 Q 

D & 2? 


<212> TYPE: PRT 














■J J u 


<213> ORGANISM: crtY amino acid 












<400> SEQUENCE: 10 














jj J 


Val 


Thr 


His Asp Val Leu 


Leu 


Ala 


v" AT a Cl~\ v T ,f=»i i 
u± y a± ci vji y JjcU 


a ZXqti v Tif^n 
nici noil vj j. y Lieu 


15- 


- > 




1 

X 




5 






10 










Tin 

lie 


Ala 


Leu Ala Leu Arg 


Ala 


Al a 


Arg Pro Asp Leu 


Arg Val Leu Leu 


15- 


- > 


con 






20 






25 


30 






CTQ 
J J 3 


JjcU 


Asp 


His Ala Ala Gly 


Pro 


OCi. 


Asp Gly His Thr 


Trp Ser Cys His 


Js - 


- > 








35 






40 


45 






J 1 ! Z 


TV ot~\ 


Pro 


Asp Leu Ser Pro 


His 


Tr-n 
lip 


Leu Ala Arg Leu 


Lys Pro Leu Arg 


15- 


- > 


CAT 




50 






33 




60 






3*± 3 




Ala 


Asn Trp Pro Asp 


Gin 


ril ii 

ul U. 


Val Arg Phe Pro 


Arg His Ala Arg 


15- 


- > 


33 0 








70 






75 






3 *± o 




Leu 


Ala Thr Gly Tyr 


Gly 


gpr 


Leu Asp Gly Ala 


Ala Leu Ala Asp 


rs- 


- > 








85 






90 










Ala 


Val 


Ala Arg Ser Gly 


Ala 


Glu 

vJ J. 


He Arg Trp Asn 


Ser Asp He Ala 


15- 


— > 








100 






105 


110 








Leu 


Leu 


Asp Glu Gin Gly 


Ala 


Thr 


Leu Ser Cys Gly 


Thr Arg He Glu 


15- 


— > 


R Q R 

3 3 3 






115 






120 


125 








Ala 


Gly 


Ala Val Leu Asp 


Gly 


Arg 


Gly Ala Gin Pro 


Ser Arg His Leu 


15 — 


-> 


3 3 O 




130 




135 




140 






J u u 


Thr 


Val 


Gly Phe Gin Lys 


Phe 


Val 


Gly Val Glu He 


Glu Thr Asp Cys 


J5 — 


— > 


DDI 


145 




150 




155 






563 

J U J 


Pro 


His 


Gly Val Pro Arg 


Pro 


Met 


He Met Asp Ala 


Thr Val Thr Gin 












165 






170 








566 


Gin 


Asp Gly Tyr Arg Phe 


He 


Tyr 


Leu Leu Pro Phe 


Ser Pro Thr Arg 


E- 




567 






180 






185 


190 






569 


He 


Leu 


He Glu Asp Thr 


Arg 


Tyr 


Ser Asp Gly Gly Asn Leu Asp Asp 


is - 


- > 


3 / U 






195 






200 


205 






572 


Asp 


Ala 


Leu Ala Ala Ala 


Ser 


His 


Asp Tyr Ala Arg Gin Gin Gly Trp 


E- 


-> 


573 




210 




215 




220 






575 


Thr 


Gly Ala Glu Val Arg Arg 


Glu 


Arg Gly He Leu 


Pro He Ala Leu 


E- 


-> 


576 


225 




230 




235 






578 


Ala 


His Asp Ala Ala Gly Phe 


Trp 


Ala Asp His Ala Glu Gly Pro Val 


E- 


-> 


579 






245 






250 








581 


Pro 


Val 


Gly Leu Arg Ala Gly 


Phe 


Phe His Pro Val 


Thr Gly Tyr Ser 


E- 


-> 


582 






260 






265 


270 






584 


Leu 


Pro Tyr Ala Ala Gin Val 


Ala 


Asp Val Val Ala Gly Leu Ser Gly 


E- 


-> 


585 






275 






280 


285 






587 


Pro 


Pro Gly Thr Asp Ala 


Leu 


Arg 


Gly Ala lie Arg Asp Tyr Ala He 


E- 


- > 


588 




290 




295 




300 



160 



15 



95 



160 



175 
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RAW SEQUENCE LISTING DATE: 10/14/2005 

PATENT APPLICATION: US/10/551 , 508 TIME: 11:04:23 

Input Set : N:\FANTU\10551508.raw.txt 
Output Set: N:\CRF4\10142005\J551508.raw 

590 Asp Arg Ala Arg Arg Asp Arg Phe Leu Arg Leu Leu Asn Arg Met Leu 
E--> 591 305 310 315 320 

593 Phe Arg Gly Cys Ala Pro Asp Arg Arg Tyr Thr Leu Leu Gin Arg Phe 
E--> 594 325 330 335 

596 Tyr Arg Met Pro His Gly Leu lie Glu Arg Phe Tyr Ala Gly Arg Leu 
E--> 597 340 345 350 

599 Ser Val Ala Asp Gin Leu Arg lie Val Thr Gly Lys Pro Pro lie Pro 
E--> 600 355 360 365 

602 Leu Gly Thr Ala lie Arg Cys Leu Pro Glu Arg Pro Leu Leu Lys Glu 
E--> 603 370 375 380 

605 Asn Ala 

606 385 

609 <210> SEQ ID NO: 11 

610 <211> LENGTH: 1506 

611 <212> TYPE: DNA 

612 <213> ORGANISM: crtl gene 

614 <400> SEQUENCE: 11 

615 atgaacgccc attcgcccgc ggccaagacc gccatcgtga tcggcgcagg ctttggcggg 60 
617 ctggccctgg ccatccgcct gcagtccgcg ggcatcgcca ccaccctggt cgaggcccgg 120 
619 gacaagcccg gcgggcgcgc ctatgtctgg cacgatcagg gccatgtctt cgacgcgggc 180 
621 ccgaccgtca tcaccgaccc cgatgcgctc aaggagctgt gggcgctgac cgggcaggac 240 
623 atggcgcgcg acgtgacgct gatgccggtg tcgcccttct atcgactgat gtggccgggc 300 
625 gggaaggtct tcgattacgt gaacgaggcc gatcagctgg agcgccagat cgcccagttc 360 
62 7 aacccggacg acctggaagg ataccgccgc ttccgtgatt acgcggagga ggtgtatcag 42 0 
629 gagggctacg tcaagctggg caccgtgccc ttcctcaagc tgggccagat gctcaaggcc 480 
631 gcgcccgcgc tgatgaagct ggaggcctat aagtccgtcc atgccaaggt cgcgaccttc 540 
633 atcaaggacc cctatctgcg gcaggcgttt tcgtatcaca cgctgctggt gggcgggaat 600 
635 cccttctcga ccagctcgat ctatgcgctg atccacgcgc tggagcggcg cggcggggtc 660 
637 tggttcgcca agggcggcac caaccagctg gtcgcgggca tggtcgcgct gttcgaacgg 72 0 

. 639 cttggcggcc agatgatgct gaacgccaag gtcgcccgga tcgagaccga gggcgcgcgg 780 
641 accacgggcg tcaccctggc ggacgggcgg tctttaaggg ccgacatggt cgccagcaac 840 
643 ggcgacgtca tgcacaacta tcgcgacctg ctgggccaca cggcccgcgg gcagagccgc 900 
645 gcgaaatcgc tggaccgcaa gcgctggtcc atgtcgttgt tcgtgctgca tttcggtctg 960 
647 cgcgaggcgc ccaaggacat cgcgcatcac accatcctgt tcggcccccg ctacagggag 1020 
649 ctggtcaacg agatcttcaa gggcccgaag ctggccgagg atttctcgct gtacctgcat 1080 
651 tcgccctgca cgaccgatcc ggacatggcg cctccgggca tgtccacgca ttacgtgctg 1140 
653 gcccccgtgc cgcatctggg ccgcgccgag atcgattggg cggtcgaggg gccgcgctat -1200 
655 gccgaccgca tcctggcgtc cctggaggag cggctgatcc cgaacctgcg cgccaacctg 12 60 
657 accacgacgc gcatcttcac gcccgccgat ttcgccagcg aactgaacgc ccatcacggc 1320 
659 agcgccttct cggtcgagcc gatcctgacg caatccgcgt ggttccggcc gcacaaccgc 13 80 
661 gacaagacga tccgcaactt ctatctggtc ggcgcgggca cccatccggg cgcgggcatt 1440 
663 ccgggcgtcg tgggctcggc caaggccacg gcccaggtga tgctgtccga cctggcgggc 1500 
E--> 665 gcatga 
666 1506 

669 <210> SEQ ID NO: 12 

670 <211> LENGTH: 501 

671 <212> TYPE: PRT 

672 <213> ORGANISM: crtl amino acid 
674 <400> SEQUENCE: 12 



flf<$> 
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E--> 748 385 390 395 400 

750 Ala Asp Arg lie Leu Ala Ser Leu Glu Glu Arg Leu lie Pro Asn Leu 
E--> 751 405 410 415 

753 Arg Ala Asn Leu Thr Thr Thr Arg lie Phe Thr Pro Ala Asp Phe Ala 
E--> 754 420 425 430 

756 Ser Glu Leu Asn Ala His His Gly Ser Ala Phe Ser Val Glu Pro lie 
E--> 757 435 440 445 

759 Leu Thr Gin Ser Ala Trp Phe Arg Pro His Asn Arg Asp Lys Thr lie 
E--> 760 450 455 460 

762 Arg Asn Phe Tyr Leu Val Gly Ala Gly Thr His Pro Gly Ala Gly lie 
E--> 763 465 470 475 480 

765 Pro Gly Val Val Gly Ser Ala Lys Ala Thr Ala Gin Val Met Leu Ser 
E--> 766 485 490 495 

768 Asp Leu Ala Gly Ala 
E--> 769 500 

772 <210> SEQ ID NO: 13 

773 <211> LENGTH: 915 

774 <212> TYPE: DNA 
775 ? <213> ORGANISM: crtB gene 

777 <400> SEQUENCE: 13 

778 atgagcgatc tggtcctgac ctcgaccgag gcgatcaccc aagggtcgca aagctttgcc 60 
780 acggcggcca agctgatgcc gccgggcatc cgcgacgaca cggtgatgct ctatgcctgg 120 
782 tgccgccacg cggatgacgt gatcgacggt caggccctgg gcagccgccc cgaggcggtg 180 
784 aacgacccgc aggcgcggct ggacggcctg cgcgtcgaca cgctggcggc cctgcagggc 240 
786 gacggtccgg tgaccccgcc ctttgccgcg ctgcgcgcgg tggcgcggcg gcatgatttc 3 00 
788 ccgcaggcct ggcccatgga cctgatcgaa ggcttcgcga tggatgtcga ggcgcgcgac 360 
790 tatcgcacgc tggatgacgt gctggaatat tcctatcacg tcgcaggcat cgtcggcgtg 42 0 
792 atgatggccc gcgtgatggg cgtgcgcgac gatcctgtcc tggaccgcgc ctgcgacctg 480 
794 gggctggcgt tccagctgac caacatcgcg cgcgacgtga tcgacgatgc gcgcatcggg 540 
796 cggtgctatc tgccggggga ctggctggac caggcgggcg cgcggatcga cgggccggtg 600 
798 ccgtcgccgg agctgtacac agtgatcctc cggctgttgg atgaggcgga accctattac 660 
800 gcgtcggcgc gggtgggtct ggcggatctg ccaccgcgct gcgcctggtc catcgccgcc 720 
802 gcgctacgga tctatcgcgc catcgggctg cgcatccgca agagcgggcc gcaggcctat 780 
804 cgccagcgga tcagcacgtc caaggctgcc aagatcggcc tgctgggcgt cgggggctgg 840 
806 gatgtcgcgc gatcacgcct gccgggggcg ggcgtgtcgc ggcagggcct ctggacccgg 900 

E--> 808 ccgcatcacg tctag 

809 915> — 

812 <210> SEQ ID NO: 14 

813 <211> LENGTH: 304 

814 <212> TYPE: PRT 

815 <213> ORGANISM: crtB amino acid 

817 <400> SEQUENCE: 14 

818 Met Ser Asp Leu Val Leu Thr Ser Thr Glu Ala lie Thr Gin Gly Ser 
E--> 819 15 10 15 

821 Gin Ser Phe Ala Thr Ala Ala Lys Leu Met Pro Pro Gly lie Arg Asp 
E--> 822 20 25 30 

824 Asp Thr Val Met Leu Tyr Ala Trp Cys Arg His Ala Asp Asp Val lie 
E--> 825 35 40 45 

827 Asp Gly Gin Ala Leu Gly Ser Arg Pro Glu Ala Val Asn Asp Pro Gin 
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E--> 940 100 

942 Gly He Ala Leu He Thr Glu Ala 
E--> 943 115 

945 Gly Ala Ser Gly Thr Val Arg Ala 
E--> 946 130 135 

948 Ser Leu Gly Pro Gin Gly Leu Cys 
E--> 949 145 150 

951 Ala Ala Lys Asn Gly Ala Gly Val 
E--> 952 165 

954 Gly Val Leu Phe He Ala Gly Leu 
E--> 955 180 

957 Phe Asp Ala Glu Glu Gin Thr Gin 
E--> 958 195 

960 Gly Arg Val Phe Gin Ser Tyr Asp 
E--> 961 210 215 

963 Gin Ala Ala Leu Gly Lys Asp Thr 
E--> 964 225 230 

966 Pro Arg Arg Gly Leu Leu Ala Val 
E--> 967 245 

969 His Tyr Glu Ala Ser Arg Ala Gin 
E--> 970 260 

972 Arg Leu Gin Ala Pro Glu He Ala 
E--> 973 275 

975 Tyr Ala Ala Arg Ala 

976 290 
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Seq#:l; Line(s) 7,8 



file://C:\CRF4\OUTHOLD\VsrJ55 1508.htm 



10/14/05 



Page 11 of 12 



VERIFICATION SUMMARY DATE: 10/14/2005 

PATENT APPLICATION: US/10/551, 508 TIME: 11:04:24 
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L:13 M:270 C: Current Application Number differs, Replaced Current Application No 
L:13 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:106 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:1454 SEQ:3 
L:355 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:729 SEQ: 
L:366 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:6 
M:332 Repeated in SeqNo=6 

L:436 M:254 E: No. of Bases conflict, LENGTH: Input : 0 Counted:489 SEQ:7^ 
L:447 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:8 - 
M:332 Repeated in SeqNo=8 / 
L:534 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:10</ 
M:332 Repeated in SeqNo=10 

L:665 M:254 E: No. of Bases conflict, LENGTH: Input : 0 Counted:1506 SEQ:11 
L:676 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:12_J/ 
M:332 Repeated in SeqNo=12 

L:808 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted: 915 SEQ: 13 
L:819 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 14 
M:332 Repeated in SeqNo=14 

L:922 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 16 
M:332 Repeated in SeqNo=16 
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